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This work reports a quantitative analysis to predicting the efficiency of distributed computing 
running in three models of complex networks: Barabsi-Albert, Erdos-Rnyi and Watts-Strogatz. A 
master/slave computing model is simulated. A node is selected as master and distributes tasks 
among the other nodes (the clients). Topological measurements associated with the master node 
(e.g. its degree or betwenness centrality) are extracted and considered as predictors of the total 
execution time. It is found that the closeness centrality provides the best alternative. The effect of 
network size was also investigated. 

I. INTRODUCTION II. METHODOLOGY 



Despite all the scientific and technological advances in 
scientific computing obtained along the last decades, the 
demand on intensive number crunching remains as high 
as ever. This has been caused by at least the three fol- 
lowing factors: (i) increasing amount of data being con- 
tinuously produced in virtually all areas; (ii) the higher 
resolutions of such data, often involving 3D images and 
movies; and (iii) the focus on complex, non-linear sys- 
tems that has underlain much of the basic and applied 
sciences. Given that the individual processing power has 
saturated [T], ever increasing attention has been placed 
on distributed and networked systems. One particularly 
attractive possibility, known as grid- computing involves 
the use of a large number of computers, duly connected 
to the Internet, as a massive computing resource [2]. The 
immediate advantage of such an approach is to provide 
the access to a large mass of already existent machines, 
without much additional cost for hardware or software. 
However, the performance of a grid system will depend 
strongly on the protocol for distributing the tasks, the 
topology of the interconnections, and the type of process- 
ing required. As a matter of fact, the execution of dis- 
tributed tasks in a grid network is a largely complex prob- 
lem directly related to the structure/function paradigm 
in complex networks [3j. Previous approaches at trying 
to understand the efficiency in grid computing include 
a study of the effect of network connectivity and of 
geographical constraints [5^ on a simple grid computing 
model and how topology affects applications with real- 
time constraints [5]. Despite such recent advances, the 
investigation of how the topology of the interconnections 
on the respective performance remains an open problem. 
In particularly, it would be interesting to devise means 
to predict the execution time of grid computing systems 
from simpler network measurements, without the need of 
undergoing the whole process of real computations. The 
present work aims precisely at investigating how several 
types of networks measurements can be used to foresee 
the execution time in grid computing systems as mod- 
eled and simulated in terms of model complex networks 
of varying topology. 



A. Grid Representation as a Complex Network 

In this work, computing grids are modeled as graphs 
where the nodes are computers and the edges are two- 
directional internet-like connections (e.g. TCP/IP). This 
gives rise to an undirected complex network with N 
nodes, that can be represented by a respective adjacency 
matrix, A. The element of the i-th row and j-th column 
of the matrix, denoted by A^j , has value 1 if and only if 
there is a link between the i-th and the j-th node, and 
the value 0, otherwise. 

The main issue in the current work is to choose a com- 
puter that will distribute the tasks to all other computers 
in such a way as to minimize the total execution time. 
It is natural to suppose that this optimum node should 
be, in some sense, central to the grid topology. With 
this in mind, the following metrics have been chosen in 
order to characterize the centrality of each node: degree 
centrality, betweenness centrality and closeness centrality 
(related with the average distance). Another important 
measure was included: the local clustering coefficient. All 
these metrics can be derived from the adjacency matrix 
A. 

Consider a node i. The degree centrality (also known 
as node degree) gives the number of adjacent links to i 
and is easily calculated by taking the number of non-zero 
elements in row i: 

The betweenness centrality tries to quantify how much 
a node is in the path connecting two other nodes. Con- 
sidering all shortest paths connecting all pairs of nodes, 
the betweenness centrality of i is related to the number 
of these paths passing through i and defined as 



where Sij is the total number of shortest paths between 
nodes j and k, and Sjk{i) is the number of those paths 
that pass through node i. In this work we used the algo- 
rithm of Brandes |5] for its computation. The closeness 
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centrality of node i quantifies how close it is to the other 
nodes in the network and is taken as the reciprocal of the 
mean of the distances from i to all other nodes: 

N 



where is the distance between nodes i and j and is 
taken as the number of edges of the shortest path con- 
necting them. Finally, the local clustering coefhcient of i 
is defined as the number of links connecting its neighbors 
divided by the number of links necessary to fully connect 
all those neighbors (the number of links necessary to con- 
nect each and every pair of neighboring nodes) , leading 
to a number between and 1: 



kj (kj 



where is the number of links among neighbors of node 
i. 



B. Network Models 

Three complex network models were chosen for this 
work: Erdos-Renyi (ER), Barabsi- Albert (BA) and 
Watts-Strogatz (WS) 0. 

In the ER model used fixes the number of nodes in 
the network N and the probability p of creating a link 
between any two nodes. 

The Erdos-Rcnyi model is interesting for comparison 
and analysis but it lacks some properties found in real 
computing networks. Preferential attachment models 
like the Barabasi- Albert (BA) are more realistic, because 
they generate networks with degree distributions follow- 
ing a power law, very similar to those found in the real 
Internet |10j . Considering that many grid computing 
infrastructures are built upon the Internet or upon an 
Internet like network, the BA model is included in our 
analysis. BA networks are built by starting with a few 
unconnected nodes; for each new node created, m con- 
nections are generated from this node to others in such 
a way that nodes with higher degree are more likely to 
receive one of the m connections. The resulting network 
average degree will be equal to 2m. 

As another example of a network model that has 
different properties than the aforementioned, we chose 
the Watts-Strogatz (WS) model which implements the 
"small- world effect." Short distances are frequently de- 
sirable in grids in order to minimize the communication 
latency among nodes. This model combines clustering 
coefficients that are higher than those in the BA and 
ER models with small distances. It starts with a regular 
structure such as a ring, with average degree k and then 
rewires some edges with uniform probablity r. Larger 
values of r decrease clustering and average distances, but 
distances fall much faster. It is therefore possible, by ad- 
justing r, to see the effect of high clustering and small 
distances simultaneously. 



C. Computational model 

The computational model used is the same introduced 
in Ref. [4]. The application works as master/slave, with 
a master having a set of tasks that are distributed to the 
slaves for processing. All tasks require the same amount 
of time to compute. After choosing one of the nodes as 
the master, all other nodes in the network are slaves. 
Figure [T] shows a diagram of the computational model, 
which illustrates the time necessary for the slaves to send 
a message to the master. The processing starts with the 
slaves sending a task request to the master. For each 
request received, the master sends a task to be computed. 
After the slave finishes computing the task, the result is 
returned to the master, which sends another task to the 
slave, if there are yet tasks to compute (see Figure [T]). 
When the results of all tasks are received by the master, 
the application stops. 

We consider that the time to send a task or result 
through each link is the same, and use this as the time 
unit. We also assume that the time taken by the master 
to choose a task to send and by each node to route packets 
to a neighbor node are negligible. Also, all nodes have 
the same computational power. 

At the start of the execution, a slave that lies at a 
distance d from the master will need to wait for a task 
for 2d time units {d units for its request to reach the 
master and another d for the task to arrive). Suppose 
the node takes c time units to compute the task. The 
master will then receive the result for the first task from 
this slave at 2d + c + d = id + c. Subsequent task results 
will arrive from this slave at intervals of 2d -f c (because 
the arrival of a result is a sign that the slave is idle, and 
the master can send a new task). If the total number 
of tasks is sufficiently large, all slaves at distance d from 
the master will receive approximately the same number 
of task to compute; we represent this number of tasks as 
m{d), as it depends on the distance. They will therefore 
take a total time of 



T(d) = d + m{d){2d + c). 



(1) 



Again if the total number of tasks is sufficiently large, 
all slaves will end processing approximately at the same 
time, such that we can consider T{d) in Equation [T] as 
the total execution time, but we will further simplify the 
expression because in this case m{d){2d + c) is large with 
respect to d, giving: 



T = m{d){2d + c). 
If we have a total of M tasks to execute we get: 



M = £ n(d)m(d) = £ 



d=l 



n{d)T 
2d + c 



(2) 



(3) 



where n{d) is the number of nodes at distance d from 
the master and c?max is the maximum distance from the 
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(a) Computational model (b) Steps of task execution 

FIG. 1: (a) Diagram showing the computation model, each connection requires the same amount of time (one unit), 
(b) Steps of the model, 1) slave sends a task request to the master, 2) master sends a task to be computed and 

3) Slave returns the result to the master 



master for all nodes. Isolating T in Equation |3] we get 

M 



TABLE I: Average (30 samples) Pearson coefficient 
value 



T ■ 



(4) 



If we have c ^ 2d, Equation |4] reduces to 

Mc 
iV- 1' 

where N is the number of nodes in the network. This 
is the best possible result for this application, and is 
achieved independently from network topology. On the 
other hand, if c <C 2d, most time is taken sending and 
receiving tasks, and it will make no sense to use a grid 
computing system (better results would be achieved by 
local execution of the tasks). The interesting values are 
then for c ~ 2c?, where total execution time depends on 
topology. More specifically, the results depend on the 
hierarchical structure of the network 



III. RESULTS AND DISCUSSION 

In order to quantify the overall processing perfor- 
mance, we adopt the concept of efficiency [12,. This 
measurement is calculated dividing the achieved speed- 
up by the ideal speed-up. Speed-up refers to the ratio 
between the total execution time with one processor and 
the total execution time using all the processors in the 
network. Therefore, the ideal speed-up is N , i.e. the num- 
ber of available processors (nodes). 

The performed experiments proceed in three stages. 
First we compare the potential of several network topol- 
ogy measurements in predicting the grid efficiency for a 
BA configuration. Then, we investigate the effect of net- 
work size in the prediction. Finally, we consider other 
network models, more specifically the ER and WS mod- 
els. 



Measure 


Average Pearson 


Degree 


0.611 


Clustering 


0.212 


Closeness 


0.918 


Betweenness 


0.514 



A. Preliminary Analysis 



In order to have an initial indication of how several 
measurements of the topology of the networks influ- 
ence execution time, we considered a BA networks with 
N = 1000 and (fc) >= 6 (the latter reflects the trend 
observed in the real internet [H]). The individual tasks to 
be performed were assumed to be 10 (in the choosen time 
unit that corresponds to the time to send a task through 
a network link). Each simulation involves 10000 tasks. 
We obtained the execution times for 30 network configu- 
rations, which had their topology characterized in terms 
of their average degree, clustering coefficient, betweeness 
and closeness centralities. Figure [2] shows the process- 
ing efflciency in terms of each of these measurements, 
with respective Pearson correlation coefficients collected 
in Table |l] It is clear that the closeness exhibited a signif- 
icant correlation with efficiency and the largest Pearson 
coefRcient when compared with the other measurements, 
suggesting that it can be used to provide accurate pre- 
dictions of the processing efficiency. For this reason, we 
henceforth focus on the closeness in this work. 



4 



7.0 

6.8 

6.6 

6.4 

^ 6.2 
o 

g 6.0 

M 5.6 
5.4 
5.2 
5.0 
4.8 



40 



Pearson Coefficient=0.602 

120 ] 



7.0 
6.8 
6.6 
6.4 
6.2 

u 

g 6.0 

W 5.6 
5.4 
5.2 
5.0 
4.8 



60 80 100 

Average Degree 
(a) Efficiency X Average Degree 



6, 

S 6, 



III^^^M^wwillllll III iiiiiiiiiiii 



IIIIIIIIIIII II I II I III m ill 



Pearson Coefficient=0.504 





7.0 




6.8 




6.6 




6.4 




6.2 


u 






6.0 








5.8 






w 


5.6 




5.4 




5.2 




5.0 




4.8 



40000 60000 80000 100000 

Betweeness Centrality 
(c) Efficiency X Betweeness 



120000 14000C 















^^Mll|]|i|i| II III 1 

■nnirtiitiitit itii 

Miiiiiiiiiiiiiiiiiiii 1 1 

Mlllllllllllll Mill II TT 


+ + + 

iM- 1 + 
4+f ++ + + 


+ - 
+ + + 




- + + + 

- + + 


+ + 
+ 


- Ilium III + 4 




+ 


Pearson Coefficient=0.221 


0.2 

(b) Efficiency 


0.4 0.6 0.8 
Clustering 

X Clustering Coefficient 



Pearson Coefficient=0.918 



0.30 0.35 

Closeness 



(d) Efficiency X Closeness 

FIG. 2: Efficiency x Measure for Barabasi model with 1000 nodes and average degree 6 



B. Effect of Network Size 



Effect of Network Models 



The previous experiment was performed with respect 
to BA networks with fixed sizes N = 1000. Now we 
investigate the effect of N on the predictive power of 
closeness. Figure [3] shows these results. The Pearson 
correlation coefficient decreases monotonically with N, 
but remains high even for N — 5000. The quantization 
effect in these results are an artifact implied because all 
tasks have the same computational load, all nodes have 
the same computational power, and all links transmit 
packets at the same rate. With our chosen parameter 
of 1 time unit for packet transmission and 10 time units 
for task computation, nodes that are at distance 1 from 
the master end their tasks at instants 13, 25, 37, 49, etc. 
while tasks from nodes at distance 2 end at instants 16, 
30, 44, 58, etc. and from nodes at distance 3 at instants 
19, 35, 51, 67, etc. When the total number of tasks is 
not large with respect to the number of nodes, only a few 
values of execution time are be possible. 



In order to investigate the effect of network topology 
on the efficiency, we performed simulations for ER and 
WS (for several values of p) models for N — 1000 and 
m = 3. The results are show in Figure |4] The Pearson 
correlation coefficients resulted very high for all cases. 
The same quantization effect already commented above 
appeared for the ER and WS with r = 0.1. This is a 
direct consequence of the fact that these configurations 
have smaller diameter. The highest efficiency values have 
been observed for the ER configuration. 



IV. CONCLUSIONS 

The proliferation of concurrent processing systems in 
the last decades as an alternative to achieving high pro- 
cessing speeds has emphasized the need not only to quan- 
tify the obtained improvements, but also to try to predict 
the respective performance. Grid computing has been 
receiving growing attention along the last years because 
of its potential to harness available processing capabili- 
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FIG. 3: Efficiency x closeness for varfous network sizes (BA model, to = 3) 



ties in a flexible and efficient manner. In this work, we 
set out to investigate how the efficiency of grid systems 
can be predicted while taking into account several topo- 
logic measurements of the topology of the grid network. 
First, we quantified, through the Pearson correlation co- 
efficient, the potential of several measurements from com- 
plex networks to predict the efficiency of BA networks, 
finding out that the closeness provided the best alter- 
native. Then, we checked out the influence of the net- 
works size (i.e. number of nodes) on the prediction of 
the efficiency. Despite a quantization effect appearing 
for large sizes, all tried situations yielded very high Pear- 
son coefficients. Finally, we also found that extremely 
good predictions can be obtained even for other network 
topologies such as ER and WS. The high values of Pear- 



son correlation coefficients obtained for most tested cases 
allow us to choose the best master node from which to 
distribute the task, i.e. the node with the lowest average 
distance to other nodes would provide the best choice for 
this finality. Future works could investigate task distri- 
bution schemes other than the master-slave configuration 
addressed in the present work. 
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